The Information Manifold Approach to Data Integration

نویسنده

  • Alon Y. Levy
چکیده

A data-integration system provides a uniform interface to a multitude of data sources. Consider a data-integration system providing information about movies from data sources on the World Wide Web. There are numerous sources on the Web concerning movies, such as the Internet Movie Database (which provides comprehensive listings of movies, their casts, directors, genres, and so forth), MovieLink (listing playing times of movies in US cities), and several sites that provide textual reviews for selected movies. Suppose we want to find which Woody Allen movies are playing tonight in Seattle and see their respective reviews. None of these data sources in isolation can answer this query. However, by combining data from multiple sources, we can answer queries like this one, and even more complex ones. To answer our query, we would first query the Internet Movie Database to obtain the list of movies directed by Woody Allen, and then feed the result into the MovieLink database to check which ones are playing in Seattle. Finally, we would find reviews for the relevant movies using any of the movie review sites. Most importantly, a data-integration system lets users focus on specifying what they want, rather than thinking about how to obtain the answers. As a result, it frees them from the tedious tasks of finding the relevant data sources, interacting with each source in isolation using a particular interface, and combining data from multiple sources.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Geometry Preserving Kernel over Riemannian Manifolds

Abstract- Kernel trick and projection to tangent spaces are two choices for linearizing the data points lying on Riemannian manifolds. These approaches are used to provide the prerequisites for applying standard machine learning methods on Riemannian manifolds. Classical kernels implicitly project data to high dimensional feature space without considering the intrinsic geometry of data points. ...

متن کامل

Integration and Reduction of Microarray Gene Expressions Using an Information Theory Approach

The DNA microarray is an important technique that allows researchers to analyze many gene expression data in parallel. Although the data can be more significant if they come out of separate experiments, one of the most challenging phases in the microarray context is the integration of separate expression level datasets that have gathered through different techniques. In this paper, we prese...

متن کامل

Integration of exhaust manifold with engine cylinder head towards size and weight reduction

In this research, a new exhaust manifold and its cooling jackets is first designed for the integrated exhaust manifold into cylinder head (IEMCH) for a turbocharged engine. Then, the gas exchange and flow analysis is carried out numerically to evaluate the proper conditions for the exhaust gas and the coolant stream respectively. Finally, the entire engine parts are thermally analyzed to assure...

متن کامل

Critical Success Factors for Data Virtualization: A Literature Review

Data Virtualization (DV) has become an important method to store and handle data cost-efficiently. However, it is unclear what kind of data and when data should be virtualized or not. We applied a design science approach in the first stage to get a state of the art of DV regarding data integration and to present a concept matrix. We extend the knowledge base with a systematic literature review ...

متن کامل

بهبود مدل تفکیک‌کننده منیفلدهای غیرخطی به‌منظور بازشناسی چهره با یک تصویر از هر فرد

Manifold learning is a dimension reduction method for extracting nonlinear structures of high-dimensional data. Many methods have been introduced for this purpose. Most of these methods usually extract a global manifold for data. However, in many real-world problems, there is not only one global manifold, but also additional information about the objects is shared by a large number of manifolds...

متن کامل

Towards an Interoperable Open GIS

The number of geo-science applications has been ever increasing over the last decades. Most of them, however, do not provide required level of data and systems integration and tend to be architecturally closed, monolithic, and costly environments. In this paper we present our approach to design and development of an open component-based Geographical Information System (GIS) architecture. The ov...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998